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CLAIMS: 



1 . A method for generating training data (Dt) for an automatic speech recogniser 
(2) for operating at a particular first sampling frequency (fu), comprising the following 
steps: 

deriving spectral characteristics (Sl) from audio data (Dl) sampled at a 
5 second frequency (ft) lower than the first sampling frequency (fn); 

extending the bandwidth of the spectral characteristics (Sl) by retrieving 
bandwidth extending information (Ibe) from a codebook (6); 

processing the bandwidth extended spectral characteristics (Sle) to give 
the required training data (Dj). 

10 

2. A method according to claim 1, where the conversion of audio data (Dh, Dl) 
into sets of spectral characteristics (Sh, Sl) comprises calculating the FFT of the audio 
data (Dh, Dl) to give a set of Fourier coefficients (31) and filtering the output of the 
FFT with a filterbank (22) to give a set of filterbank power values (32). 

15 

3. A method according to claim 2, where the conversion of audio data (Dh, Dl) 
into sets of spectral characteristics (Sh, Sl) comprises processing the FFT coefficients 
(31) or the filterbank power values (32) to give a set of log-spectral coefficients (33). 

20 4. A method according to any of claims 1 to 3, where the processing of bandwidth 
extended spectral characteristics (Sl,e) comprises a step of altering the spectrum to 
adjust signal properties of the audio data (Dl). 

5. A method according to claim 4, where the step of altering the spectrum to adjust 
25 the signal properties of the audio data (Dl) is performed in the linear domain. 
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6. A method according to any of claims 1 to 5, where the derivation of the spectral 
characteristics (Sl) from audio data (Dl) is followed by a step subtracting the mean 
spectrum from the spectral characteristics (Sl). 

5 7. A method for training an automatic speech recognition system (2) wherein the 
data (Dt) used for training are at least partially generated using a method according to 
any of claims 1 to 6. 

8. A method for generating a codebook (6) for extending the bandwidth of spectral 
10 characteristics (Sl) for audio data (Dl) sampled at a second sampling frequency (fL) to 

spectral characteristics (Sh) for a first sampling frequency (fu) higher than the second 
sampling frequency (fL), comprising the following steps for each entry of the codebook 
(6): 

deriving a first set of spectral characteristics (SCh) fi"om audio data 
15 (DCh) sampled at the first sampling firequency (fn); 

performing a sampling rate transformation on the audio data (DCh) to the 
second sampling fi-equency (fL) and deriving a corresponding second set of 
spectral characteristics (SCl); 

generating a codebook entry (12) based on the second set of spectral 
20 characteristics (SCl) and augmenting the codebook entry (12) with 

additional higher frequency information fi-om the first set of spectral 
characteristics (SCh). 

9. A method according to claim 8, where augmenting the codebook entry (12) 
25 comprises extracting information fi:-om the corresponding first set of spectral 

characteristics (Sh) pertaining to the frequencies above the second sampling frequency 
(fL) and attaching this information to the codebook entry (12) in the codebook (6). 

10. A method according to claim 8 or 9, where the derivation of the second set of 
30 spectral characteristics (SCl) is followed by a background noise reduction and/or 

channel normalization step. 
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11. A method according to claim 10, where the spectral characteristics (SCl) 
comprise a log-spectral representation, and the channel normalization is performed by 
subtracting the mean log spectrum from the log spectral characteristics (SCl). 

5 

12. A method according to claim 10, where the spectral characteristics (SCl) 
comprise a linear spectral representation, and the background noise reduction is 
performed by subtracting a background noise spectrum from the linear spectral 
characteristics (SCl). 

10 

13. A method according to claim 1 1 and 12, where the background noise reduction 
is performed by subtracting a background noise spectrum from a linear spectral 
characteristics (SCl) subsequently calculating the logarithm and then subtracting the 
mean log spectrum from the log spectral characteristics 

15 

14. A system (1) for generating training data (Dt) for an automatic speech 
recogniser (2) operating at a particular first sampling frequency (fnX comprising: 

a converter (3) for deriving spectral characteristics (Sl) from audio data 
(Dc) sampled at a second frequency (fL) lower than the first sampling 
20 frequency (fn); 

a retrieval unit (4) for retrieving bandwidth extending information for the 
spectral characteristics (Sl) from a codebook (6); 

a processing module (7) for processing the bandwidth-extended spectral 
characteristics (Sl,e) to give the required training data (Dj). 

25 

15. A system (5) for generating a codebook (6) for extending the bandwidth of a set 
of spectral characteristics (Sl) for audio data (Dl) sampled at a second sampling 
frequency (fL) to a set of spectral characteristics (Sh) for a first sampling frequency (fn) 
higher than the second sampling frequency (fL), comprising: 

30 - a converter (9) for deriving a first set of spectral characteristics (SCh) 

from audio data (DCh) sampled at the first sampling frequency (fH); 
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a module (10) for performing a sampling rate transformation on the 
audio data (DCh) to the second sampling frequency( fL) and for deriving a 
corresponding second set of spectral characteristics (SCl) for the second 
sampling frequency (ft); 
5 - a codebook entry generator (1 1) for generating an entry (12) for the 

codebook (6) based on a second set of spectral characteristics (SCl) and for 
augmenting the codebook entry (12) with additional higher frequency 
information from the corresponding first set of spectral characteristics 
(SCh); 



10 



